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We estimate how clustering in large-scale redshift surveys can constrain various cosmological parame- 
ters. Depth and sky coverage of modern redshift surveys are greater than ever, opening new possibilities 
for statistical analysis. We have constructed a novel maximum likelihood technique applicable to deep 
redshift surveys of wide sky coverage by taking into account the effects of both curvature and linear 
"^■< velocity distortions. The Fisher information matrix is evaluated numerically to show the bounds derived 

iy) ■ from a given redshift sample. We find that intermediate-redshift galaxies, such as the Luminous Red 

Galaxies (LRGs) in the Sloan Digital Sky Survey, can constrain cosmological parameters, including the 
cosmological constant, unexpectedly well. The importance of the dense as well as deep sampling in 

s ! ' designing redshift surveys is emphasized. 
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CO ' 1. INTRODUCTION 

CO 

Recently, independent cosmological observations have turned out to be consistent with a canonical model of the structure 
formation based on the assumption that the main ingredients of our universe are the cold dark matter (CDM), and the 
. dark energy component such as a cosmological constant. Introducing the unknown dark matter and the peculiar dark 
energy could have been thought of as somewhat artificial at the time when observations were not constraining enough, 
now this model has become more and more convincing with remarkable observational support. 

To make a cosmological model truly convincing, it is important to make sure that any possible observations are consistent 
O ' with that model. This can be accomplished by determining cosmological parameters as precisely as possible from each 
observation, and then comparing the results with one another. So far the individual observations were only able to 
^ constrain the parameters into some region in a multi-dimensional parameter space. Therefore, it is quite common to 
\ combine different observations to obtain the tight constraint on the parameters. A striking example is a combination of 
. fluctuations in cosmic microwave background (de Bernardis et al. 2000) and a Hubble diagram of supernova la (Riess et 
J^j ' al. 1998; Perlmutter et al. 1999) which is considered to be the prominent evidence of the current canonical model. 

However, one can test the consistency of the theory only when each observation can independently constrain the 
■ parameters in a sufficiently small region. In this paper, we explore the capabilities of redshift surveys in this respect. In 
redshift surveys, there are several degrees of freedom in planning survey strategies: the area and the depth of a survey, 
and the choice of specific targets for the spectroscopic measurements. Each of these affects the analysis in a different and 
complex fashion. Thus, it is important to estimate the impact of a given strategy on the cosmological analysis of the data. 

In this paper, we apply our considerations to the Sloan Digital Sky Survey (SDSS, York et al. 2000). It is not only the 
largest ongoing redshift survey, but it also contains three different types of spectroscopic targets within the same area of 
the sky that can be used to infer cosmological information. 

In deep, wide redshift surveys like SDSS, the apparent clustering properties in observable redshift space are complicated 
because of the redshift-evolution and redshift-distortion effects. In linear regime, with a standard initial Gaussian density 
field, the correlation function £(r) or the power spectrum P(k) completely characterizes the statistical property of the 
density field. Spatial homogeneity and isotropy in real space guarantees these to be one-dimensional functions of the 
scale, r or k. So it is usual to estimate the real-space power spectrum with certain correction methods or ansatz from 
observed redshift-space data, in order to compare with the theoretical prediction of various cosmological models (e.g., 
Davis & Peebles 1983; Feldman, Kaiser & Peacock 1994). 

However, the redshift-evolution and redshift-distortion also have invaluable information on cosmology. The redshift- 
evolution of the density fluctuations depends on the growth factor which is a function of the density parameter Qm and the 
normalized cosmological constant parameter Q\. It is also a function of the galaxy bias b which contains the information 
on the unknown galaxy formation process. The redshift-distortion depends on the density- velocity relation which is a 
function of il^f jb (Kaiser 1987), and also on 17m and fl\ through geometrical distortions (Alcock & Paczyhski 1979; 
Ballinger, Peacock & Heavens 1996; Matsubara & Suto 1996). 

When our main purpose is not only determining the power spectrum of initial fluctuations, but also constraining above 
parameters, it is desirable to directly analyze the data to obtain cosmological information without any intermediate 
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statistical quantities like the power spectrum. Since the apparent clustering in redshift space is quite different from that 
in real space, we can take advantage of the maximum-likelihood method to directly constrain cosmological models from 
the apparent clustering data in redshift space. 

In our previous work (Matsubara & Szalay 2001), we focused on how near-future redshift surveys can constrain the 
cosmological constant by the maximum-likelihood method. Since the redshift surveys become larger and larger, even 
the multi-dimensional parameter space can be constrained by redshift surveys alone. Combining with other cosmological 
observations like the cosmic microwave background fluctuations, Hubble diagram of the type la supernova, and so on, one 
can constrain the parameters more accurately. However, independent observational constraints can serve as a consistency 
check of the fundamental cosmological model. In this respect, we investigate how one can constrain the multi-dimensional 
parameter space of the standard CDM model with cosmological constant from redshift surveys, extending our previous 
work. 



2. MAXIMUM-LIKELIHOOD METHOD FOR THE APPARENT REDSHIFT-SPACE CLUSTERING 



In maximum-likelihood methods, we directly deal with the density fluctuations in redshift space. In order to do this, first 
we place smoothing cells in redshift space and consider the number of objects rii in each cell i. Each count represents the 
smoothed density field contaminated by shot noise. For given mean number densities, rii = (rii), the density fluctuations 
&i = Tii/fii — 1 are the fundamental quantities to be analyzed. In the following, the most important statistical quantity 
is the correlation matrix, CV, = (aiCij). This matrix is related to a continuous two-point correlation function in redshift 
space £(s,s') = (S(s)S(s')), where 8(s) is the underlying continuous density contrast which survey objects trace. The 
relation between the correlation matrix and the correlation function is given by 
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where W^(s) is the smoothing kernel in redshift space, normalized as J d 3 sW^ s \s) = 1 and n(s) is the mean number 
density. The last term represents contribution from the shot noise effect. In linear regime, the two-point correlation 
function in redshift space is analytically given, including high redshift effect. 

Although the general form is fairly complicated (Matsubara 2000), the distant-observer approximation (Kaiser 1987; 
Matsubara & Suto 1996), which is applicable in many cases, simplify the analysis below. The corresponding formula 
(Matsubara & Suto 1996; Matsubara 2000) is given by 
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where z\ and z 2 are the redshifts of the two points, fcy is the line-of-sight component of the wave number, P(k) is the 
linear mass power spectrum at z = 0, D(z) is the linear growth rate normalized as D(0) = 1, b(z) is the bias factor at 
redshift z, and j3(z) is the redshift distortion parameter, which is accurately fitted by the redshift-dependent mass density 

parameter Qm(z) and normalized cosmological constant as (3(z) ~ {fl^ 7 (z) + Cl\ (z) [1 + 0,m(z) / '2]/ 70} /b(z) (Lahav 
et al. 1991; Lightman & Schechter 1990). The vectors X\ and x 2 are the comoving positions of the two points which are 
labeled by Si and s 2 in redshift space (see Matsubara & Suto 1996). 

Except for the nearby universe, the comoving distance is not proportional to the redshift so that the smoothing kernel 
in redshift space is distorted along the line of sight. In a distant-observer regime, a line-of-sight component s\\ and 
a transverse component s± of redshift-space separation s are given by 



sii = H{z)x\\ 



s± 



d A (z 



(2.3) 



where x\\ and x± are corresponding line-of-sight and transverse components of real-space separation, respectively, and z is 
the mean redshift. We have chosen the unit system H = 1 here for simplicity. This unit system simplifies the equations in 
this section. We mention that another unit system for length is introduced in §4 for a practical reason which is explained 
there. In the above transform, the redshift-dependent Hubble's parameter and the comoving angular diameter distance 
are explicitly given by 



H(z) = y/Q u {l + z) 3 + (1 - O m - O a )(1 + zf + Q A , 

{-K)- 1 ' 2 sinh U-Kf^xiz)] (K < 0), 
d A (z)={ X (z) ' (K = 0), 



(2.4) 
(2.5) 

/o *(*')' (2 ' 6) 
are the curvature of the universe and comoving distance to the redshift z. Thus, the transform from the comoving space 
to the redshift space is the function of and 



where 



K 



K- 1 ' 2 sin K x ' 2 X {z) 

X(z) = 



M 



ft A, 



(K > 0), 
dz 1 



3 



As a result of the anisotropic transform of equation (2.3), the spherical kernel in redshift space, for example, ends up 
with oscillating two-dimensional numerical integration (Matsubara & Szalay 2001) in evaluating the correlation matrix 
Cij . Unfortunately, such numerical integration increases CPU time and decreases accuracy. This is crucial for the analysis 
of the real data, because the position of kernels can hardly be regular so that one has to calculate the correlation matrix 
pair by pair. The shape of the kernel is not important for largely separated pairs, but it affects nearby and identical pairs. 

Although one can not exactly know what is the distortion unless the parameters D,m and are fixed, it turns out to be 
a good approximation to set an approximate spherical kernel in real space with approximate values of these parameters. 
In fact, we found that the latter approximation is better than doing the exact numerical integration which inevitably 
has integration errors. When the parameters f^M and Q\ should also be determined in real analyses, one can iterate the 
analysis to use better estimates for those parameters, but approximate values are good enough. Thus, suppose that we 
can approximately set a spherical kernel in real space, so that we have W^(s) = J~ 1 Wr(\x\), where J = d(s)/d(x) is 
the Jacobian from x-space to s-space, and R is the (effective) smoothing radius. In the distant-observer approximation, 
J(z) = z 2 H(z)/d A 2 (z). 

This approximation dramatically simplifies the evaluation of the correlation matrix. In fact, when the kernels are 
spherical in real space, at least approximately, equations (2.1) and (2.2) give 
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where bi = b(zi), Di = D(zi), J(zi) — Ji, etc., and W(kR) is the three-dimensional Fourier transform of the spherical 
kernel IUr(|:e|). In our notation, hi is the mean number density in redshift space so that Jihi is the mean number density 
in real space. We assume that the variation of the selection function within a kernel is negligible. The above equation 
means that the distant-observer redshift-distortion is commutable with spherical smoothing in real space, comparing with 
equation (2.2) besides shot-noise term. Angular integration of the first term can be analytically performed (see Hamilton 
1992; Matsubara & Suto 1996; Matsubara 2000), resulting in 
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where Pi = (3{zi), (3j = f3(zj), Xij = \xi — Xj\ is the comoving separation between the centers of the kernels, fiij — x^/xij 
is the direction-cosine of the separation relative to the line of sight, ji is the spherical Bessel function, and Pi(^) is 
the Legendre polynomial. We also define the quantity, 

r°° k 2 dk 

&(x,R)= -—Mkx)W 2 (kR)P(k). (2.9) 
Jo 27r 

We need to numerically integrate the equation (2.9). Once we make a table of the integrated results for various x and 
R of interest, we can use this table to interpolate which makes the evaluation of the correlation matrix dj a really 
fast operation. This method is very effective because the number of elements of a correlation matrix is the square of the 
number of cell sites (iV ccll ) 2 , where 7V ccll ~ 1, 000-10, 000, or more. We tried exact 2-dimensional integrations for identical 
and nearby cell-pairs, and found that they agree with the above approximation within the integration errors, even though 
the assumed Qm and £l\ are different from the true values by a factor of about 0.5. 

The shot noise term is diagonal when the kernels do not overlap each other, as one can see from the equation (2.1). 
Using a finite- volume kernel without any overlapping region, the last term of equation (2.1) or (2.8) is given by 
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(2.10) 

most popular example of the 



finite-volume kernel, in which case the above shot noise term simply reduces to Sij/N i: where Ni — 4irR 3 Jihi/3 is the 
expected mean number count in a cell. 

Although we only use the top-hat kernel in this paper, it is also useful to consider another possibility as digression, since 
this fast numerical approximations will also be used in the analysis of the real data. The top-hat kernel is not a smooth 
function, thus it is sometimes desirable to have an smoother kernel. An example of the smoother, yet finite- volume kernels 
is the Epanechnikov kernel, which is defined by Wr(x) = 15(1 — x 2 / R 2 )Q(R — x)/(8irR 3 ). While the top-hat kernel is 
discontinuous on the edge x = R, the Epanechnikov kernel is continuous. The derivative is still discontinuous on the edge. 
One can also use the m-weight Epanechnikov kernel defined by 



K m (x;R) 



(2m + 3)!! 
m\ 2 m + 2 nR 3 



(2.11) 
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The top-hat kernel and the Epanechnikov kernel are the special cases of this general Epanechnikov kernel with m = 
and m = 1, respectively. The (m — l)-derivative of the m-weight Epanechnikov kernel is continuous on the edge. The 
larger the rank in the general Epanechnikov kernel is, the more regular it is on the edge. However, the effective volume of 
the kernel is smaller for a larger-weight kernel when a smoothing length R is fixed. Taking limits m — ► oo, R — > oo with 
mT 1 / 2 R fixed, one can show that the higher-weight Epanechnikov kernel reduces to a Gaussian kernel with a variance 
R/^/2m. The Fourier window function for the m-weight Epanechnikov kernel has a simple form, 

W m (kR) = jf 00 d* X e-^K m ( X ; R) = (2W+ ( ^ ! ) :;t l( ^ ) . (2.12) 

The larger the rank m is, the faster the window function drops off with the wavenumber k, which reflects the smoothness 
on the edge of the kernel. It is desirable to have a degree of freedom to choose an appropriate rank m, because one can 
optimize the analysis depending on the nature of an individual data set. 

The observational data is the set of discrete density fluctuations dj. The cosmological models are characterized by a 
given set of parameters (6 a ). Besides CIm, ^a, and b, which is explicitly appeared in the expression of the correlation 
matrix, the power spectrum contains several other parameters when we characterize it by a standard cold dark matter 
model. The latter parameters are the normalization as, baryon mass fraction /b = Ob /51m, spectral index n, and Hubble 
constant H , etc. 

According to the Bayes' rule, the direct likelihood function of the cosmological parameters for a data set is given by 

£(e|o) = ^™, (2 , 3) 

where C(6\a) indicates the conditional likelihood function of the parameter set 8 = (61,62, ■ ■■ ,0m) given the date set 
a = (a\, a 2 , ■ ■ ■ , a N ), £(0) indicates the prior likelihood function of the parameter set 8, and so on. Normalization 
determines the denominator of the right hand side, and the prior likelihood £(8) comes from the other observations, or 
just a constant if we do not combine our prier knowledge on the cosmological parameters. The remaining factor C(a\6) is 
the distribution function of the data set given a cosmological model. In linear regime where the density fluctuations are 
considered to be random Gaussian, this function is simply the multi-variate Gaussian distribution function: 

£(a\8) = 1 = cxp ( --a T C~ l a | , (2.14) 

1 ' ^ v /(2 7 r) Jv dctC(0) V 2 J V ' 

where C -1 is the inverse matrix of the correlation matrix C = (Cy). The matrix C is the function of cosmological 
parameters 8 as noted above. Thus, calculation of the likelihood function is reduced to the calculation of equation (2.8) 
for a given survey geometry and selection function for various cosmological models. 

3. EVALUATION OF THE FISHER INFORMATION MATRIX 

The Fisher information matrix, 

/ d\n£(a\d)d\n£(a\8) \ I d 2 In £{a\8) \ 

FaP{0) {—Ma d^—/ = - \ M a dO P / ' (3 ' 1} 

where (■■■) = / C(a\9) ■ ■ ■ d N a represents the averaging over the possible data realization given the model parameters, is 
the key quantity in estimation theory, because the Cramer-Rao theorem (Kendall & Stuart 1969; Therrien 1992) states 
that an unbiased estimator constrains the model parameters with a minimum variance 

(A^A^)"^"^" 1 )^, (3.2) 

where F _1 is the inverse matrix of the Fisher matrix and A6 a = 9 a — 6 a is the deviation of the estimated value 9 a from 
the "true" value 6 a . In the above equation, the notation " > " means that the matrix whose a/3-elements are given 
by (A6 a A6p) — (F^ 1 ) a/3 is positive semidefinite. Moreover, the equation (3.2) is satisfied with equality if one uses the 
maximum likelihood estimate, which is obtained by a maximization of the likelihood function £(a\8) as in the previous 
section: 

(A8 a A8p) = (F ^ 1 ) a/3 (maximum likelihood estimate). (3.3) 

Thus, the inverse of the Fisher matrix gives us the expected constraint on the model parameters 8. A fiducial model for 
the "true" values of 8 arc required to calculate the Fisher matrix. 

In the linear regime where the distribution function is given by equation (2.14), the Fisher matrix is analytically reduced 
to the following form (see, e.g., Vogeley & Szalay 1996; Tegmark, Taylor & Heavens 1997): 

Using this equation, the calculation of the Fisher matrix is straightforward once the calculation of the correlation matrix 
for a given model is established in the previous section. 

The ellipsoid defined by an equation A8 T FA8 = v 2 defines the concentration ellipsoid which is interpreted as error 
bounds for a given observation. If the parameter likelihood function £(8\a) is Gaussian, the concentration ellipsoid with 
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a fixed value of v represents the va confidence level when one uses the maximum likelihood estimate. Although in general 
the parameter likelihood function is not Gaussian, the concentration ellipsoids still give the rough idea of expected error 
bounds with a given confidence level of v. 

When the parameter space is just two-dimensional, one can easily plot the concentration ellipses. The presentation of 
higher-dimensional space is not easy. We have at most seven parameters in our model: (Om, b, /b, h, erg, n). To depict 
the higher dimensional concentration ellipsoids, we can plot all the possible two-dimensional parameter spaces which are 
marginalized over all the other parameters. This gives the idea how one can impose the constraints in the multi-dimensional 
parameter space. Although the form of the parameter likelihood function is not known, we can assume it is approximated 
by a multi-variate Gaussian function so that C{9\a) ~ cxp(— ^A6 T FA6). In which case, the marginalization over 
the other parameters but two can be performed. For example, from the standard Gaussian integration, marginalized 
2-dimensional likelihood function for two parameters 9 a and 6p is given by 



C(6 a ,6p\a) ~ exp 



±(A0 a A0p) 



(F- 1 ) (F- 1 ) a 
( F -i) ( F -i) 




(3.5) 



up to normalization factor, where (F ) a /3 indicates the a/3-element of the inverse matrix of the original higher-dimensional 
Fisher matrix, and so on. Therefore, plotting the concentration ellipses in the marginalized parameter space is straightfor- 
ward: they are given by contours of the equation (3.5). Generally, concentration ellipsoids in the marginalized parameter 
sub-space are deduced from the sub-matrix of the full inverse Fisher matrix. 



4. RESULTS FOR THE SDSS GEOMETRY 

The largest redshift map which will be available in next several years is the SDSS survey. Therefore, it is of great 
interest how the maximum-likelihood method described in §2 can constrain various cosmological parameters. We have 
seen in §3 that it is straightforward to estimate the expected constraints on parameters given the survey geometry. We 
examine the maximum-likelihood method with the SDSS geometry as a typical application of our method. 

Redshift-space observations in the SDSS survey consist of three samples of objects (York et al. 2000). The first sample 
is the set of approximately 1,000,000 "Main Galaxies", which are selected by limiting an r'-band magnitude. The redshift 
of main galaxies extends up to z ~ 0.2. The second sample is the set of approximately 120,000 "Luminous Red Galaxies" 
(LRG's), which are selected on the basis of color and magnitude (Eisenstein ct al. 2001). Because of the clever selection 
criteria, the LRG's are essentially a distance-limited sample whose redshift extends up to z ~ 0.45. The third sample is 
the set of approximately 100,000 quasars, which are selected from color-color diagrams. The redshift of the quasar sample 
extends up to z ~ 3. 

In evaluating Fisher information matrices, the precise geometry of the survey is not needed, and one can approximate 
the survey geometry to reduce the numerically intense procedure of inverting huge matrices. We setup a generic cubic 
box with 1,000 cells for each samples. The redshift of the box is set to be a "mean" redshift of each sample: z = 0.1 for 
main galaxies, z = 0.3 for LRG's, and z = 1.8 for quasars. The precise value of this mean redshift is not a decisive factor 
in the Fisher matrix evaluations. The finitcness of the spatial volume and the number density of the objects are the main 
source of the statistical uncertainty, i.e., cosmic variance and shot noise. From the sky coverage of the three samples of 
the SDSS spectroscopic survey, which is about pi-steradian, and from the redshift distribution of each samples, one can 
obtain expected spatial volumes and number densities for each sample. 

Although the unit system Ho = 1 used in the previous sections simplifies the equations, it is not convenient in mentioning 
the clustering scales which become small figures in this system. Thus, we use a different unit system when we have to 
mention the length scale in the following paragraphs. Following Matsubara & Szalay (2001), we adopt a coordinate system 
in which radial distance s to the objects is measured by redshift times Hubble length: s = cz/H = (2997.9 h~ 1 Mpc) x z, 
directly observable quantity, instead of the comoving distance. The latter depends on cosmological models through 
parameters Om and f^A, which we would like to measure. We introduce the factor c/Hq in the definition of s simply in 
order to avoid the small figures when we mention 10-100 Mpc scales. This measure of the radial distance s is a linear 
extrapolation of the distance-redshift relation of the nearby universe. We save the unit /i _1 Mpc for the usual comoving 
coordinates, and we use a new notation /i _1 Mpc 2 for the measure s. The relation of the distance s and the comoving 
distance x is redshift-dependent and is given by a differential form, ds = H(z)/Hq ■ dx. In the unit system c = 1 and 
Hq — 1 which we used in previous sections, the distance s is simply the redshift z. 

We put the 1,000 cells in a generic cubic box of L 3 on regular 10 x 10 x 10 sites in redshift space. The top-hat kernel 
[to = in equation (2.11)] is used and the radius R of the kernel is just a half of the cell separation: R = L/20, so that 
there are no overlapping regions to ensure the independence of the cell volumes. The Fisher matrix is straightforwardly 
calculated for this generic cubic region. This generic box is smaller than the SDSS survey volume, therefore the Fisher 
matrix can be rescaled as if the whole survey volume consisted of many independent generic cubic boxes. One can easily 
see that the Fisher matrix of the composite sample is just given by a sum of Fisher matrices of the individual samples. 
Although we will miss the information from correlations between the boxes, the signals from the correlation at such large 
separations are very small. 

The box size L is chosen so that the cell radius be large enough to track the linear regime and not to have large shot 
noise. On the other hand, too large L makes the cosmic variance large. Therefore, we should take a smallest L which 
satisfies the above two conditions. We take L = 200/i~ 1 Mpc z for main galaxies and LRG's, and L = 1000 /i _1 Mpc 2 for 
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quasars. As a result, we have the number densities 35/(20 h 1 Mpc 2 ) 3 for main galaxies, 0.5/(20 h 1 Mpc z ) 3 , for LRG's, 
and 0.12/(100 ft^MpcJ 3 for QSO's. 

We assume fiducial values of model parameters such as (SJm, SI a, SIb/SIm, h, erg, n) — (0.3, 0.7, 0.13, 0.7, 1, 1). The bias 
parameters are expected to be different among each samples. We take fiducial values 6=1,2,2 for main galaxies, LRG's 
and quasars, respectively. For an evaluation of the power spectrum, we use an useful fitting formula 1 given by Eisenstcin 
& Hu (1998) instead of running a Boltzmann code such as the publicly available CMBfast 2 (Seljak & Zaldarriaga 1996). 

In Figure 1 we plot the expected errors from the fiducial model, when the mass density parameter SIm and the cosmo- 
logical constant parameter Ha are simultaneously evaluated. Three contours for each sample represent the concentration 
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Fig. 1. — The concentration ellipses for the cosmological parameters !Jm and Qa- Expected 68%, 95%, 99.7% confidence levels are shown. 
Dashed lines are for main galaxies, solid lines for luminous red galaxies (LRGs), and long-dashed lines for quasars (QSOs). 



ellipses of v = 1,2,3 which correspond to expected ler, 2a, 3a constraints in the parameter space, as described in the 
previous section. One can notice that the LRGs impose the best constraint on the Hm-^a plane. The main galaxies can 
only probe a shallow redshift region, so that the cosmological constant does not strongly contribute to the redshift-space 
clustering, and therefore is less constrained. Although the QSOs can probe deep redshifts, they are too sparse to detect 
the redshift-space clustering itself. The LRGs nicely have intermediate properties and are balanced between the depth 
and the density of the survey objects. 

In Figure 2 we plot the expected errors when three parameters, SIm, SI a, SIb/SIm are simultaneously evaluated. In each 
panel, marginalized errors are plot for each set of two parameters, as described in the previous section. In this case, the 
QSOs do not constrain each parameter well, because of their sparseness. Main galaxies and LRGs can impose similar 
constraints for the baryon fraction and the mass density parameter. 

In Figure 3 we plot the expected errors when four parameters, SIm, SIa, SIb/SIm, b are simultaneously evaluated. The 
results of the QSOs are not plotted just because the concentration ellipses are too large and comparable to the bounding 
boxes. In this Figure, the bias parameter b is normalized by the fiducial values frfid = 1,2,2 for each sample, just for 
convenience. The bias parameters in this Figure are assumed to be constants without redshift-evolution. If this is not the 
case, one can divide the sample into redshift bins, and the expected errors of bias parameter will be worse, according to 
the decrease of the sample volume. However, the error estimates of the other parameters in this Figure are approximately 
independent of the redshift evolution of the bias due to the marginalization over the bias parameter. This is because the 
information from cross terms in the redshift bins of ~ 0.1 are already ignored in our analysis, and the likelihood function 
with a fixed bias parameter is not too sensitive to the bias parameter Matsubara & Szalay (2001). 

In Figure 4 we plot the expected errors, while simultaneously evaluating five parameters, SIm, SI a, ^b/^m 7 08 j and b. 
The bias parameter b and the normalization as are strongly correlated because their main contribution to the correlation 
matrix is through the amplitude of the correlations which is proportional to b 2 a% 2 . In Figure 5 we plot the expected 
errors, while simultaneously evaluating another set of five parameters, SIm, SIa, SIb/SIm, n, and b. 

Any similar plot can be generated for an arbitrary set of simultaneously determined parameters. There are (,) + 
(2) + • • • + (7) = 127 ways to choose which parameters are fixed, so that we need 122 figures more to be complete. 
Instead we have shown only several representative cases above. In Table 1 are given all the elements of our estimated 

1 http:/ /background. uchicago.edu/~whu/transfer/transferpage. html 

2 http://physics.nyu.edu/matiasz/CMBFAST/cmbfast.html 
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Fig. 2. — The marginalized 2D concentration ellipses when three cosmological parameters are simultaneously determined. 
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Fig. 3. — The marginalized 2D concentration ellipses when four cosmological parameters are simultaneously determined. 



seven-dimensional Fisher matrices from which all the information can be extracted according to the way described 
the previous section. The listed matrix elements are normalized by fiducial values as F a p/(6 a 0p)fid- Thus, the inverse 
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square-root of each diagonal element gives the fractional error relative to the fiducial value of each parameter when a single 
parameter is determined knowing the other parameters. The square-root of the diagonal elements of the inverse of any 
sub-matrix gives the fractional error of the parameters when corresponding parameters are simultaneously determined. 

Basically, the LRG's can impose constraints on each parameter comparable to, or even better than main galaxies. The 
reason that the main galaxies do not work so well (even though their number is 10 times larger than that of LRG's) is 
that we only use the linear regime of the clustering. Therefore, the depth of the LRG sample is more critical for the 
parameter estimation than the higher density of the main galaxies. The quasars are hopeless when many parameters are 
simultaneously determined, because of their extreme sparseness. In this way, our analysis gives the idea how the dense 
sampling affects cosmological parameter estimations. 

5. CONCLUSIONS 

In this paper, we described a maximum-likelihood method to determine the cosmological parameters from apparent 
redshift-space clustering. We took advantage of the analytical formula for linear two-point correlations in redshift space 
with cosmological distortions. We introduced an accurate approximation in calculating the elements of the correlation 
matrix among cells in redshift-space. This step makes the method practically feasible in spite of the large dimensionality 
of the correlation matrices. 

In the maximum-likelihood method, one can easily evaluate the expected parameter estimation errors in any sample 
from the Fisher information matrix. We have calculated the seven-dimensional Fisher matrix for three types of objects 
in the SDSS survey: main galaxies, LRGs and quasars. To illustrate the behavior of the multi-dimensional Fisher matrix 
we have used concentration ellipses in marginalized two-dimensional parameter space. 

In this paper, we divided the survey volume into generic boxes in order to simplify the Fisher matrix estimation. We 
ignored the correlations between these sub-regions, so the constraints will improve somewhat if those correlations are 
properly included. However, the inversion of the resulting huge matrices can become extremely time-consuming. The use 
of the Karhunen-Loeve (KL) transform is a practical strategy in this case. Such methods can also be used in a targeted 
data-compression role to find linear combinations of counts which retain as much information about the parameters as 
possible (Vogeley & Szalay 1996; Tegmark, Taylor & Heavens 1997; Matsubara, Szalay & Landy 2000; Taylor et al. 2001). 



9 




The choice of the cell radius is somewhat arbitrary in this work. We choose the spherical cell with radius of 10 h~ 1 Mpc 
for galaxies and LRGs, which is the border of the linear regime. With larger cell radius, the validity of the linear 
theory increases and the shot noise is reduced. The cosmic variance, however, increases with cell radius. The parameter 
estimation is dominated by the highest signal-to-noise modes, which are at large wavelength, in particular for the case of 
the LRG sample. The high frequency modes close to pixel scales mostly contain shot noise after the KL transformation. 
As a result, we believe our conclusions are not sensitive to the choice of the cell radius. A fully accurate determination of 
the optimal choice of the cell radius depends on the behavior of the nonlinear effects, so that a comparison with numerical 
simulation is needed, beyond the scope of the current work. 

We have considered three subsets of the SDSS redshift data, spanning a wide range of depth, sampling density and 
intrinsic clustering strength. We found, that for measuring cosmological parameters in the linear regime there is a clear 
optimum, represented by the intcrmcdiate-redshift LRG's. The low spatial density of quasars is not overcome by their 
much larger depth, and the relatively small depth of the main SDSS galaxies is not compensated by their high sampling 
density - the redshift is not high enough to test curvature, and their cosmic variance is too large. The LRG sample, 
much smaller in numbers than the main sample, and much shallower than the quasars is an excellent compromise between 
sampling density and cosmological depth. The constraints derived from the LRGs are much tighter than for the other 
two samples. 

The advantage of these intermediate-redshift objects, and the logic behind this optimum goes beyond the SDSS. In 
designing future redshift surveys, it is important to find the right balance between the density of objects and the survey 
depth. Their interplay can be quite complex, as we have shown here. The relation between accuracy and sky coverage is 
simple and can be estimated analytically. 
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couragement of Young Scientists, 13740150, 2001. AS acknowledges support from grants NSF AST-9802 980 and NASA 
LTSA NAG-53503. 
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Table 1 

Normalized seven-dimensional Fisher Information matrices F al 3/(9 a 9f3)fid for the SDSS main galaxies, LRG's, and 

QSO's. 
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